Order:
  1.  17
    The Challenges of Large‐Scale, Web‐Based Language Datasets: Word Length and Predictability Revisited.Stephan C. Meylan & Thomas L. Griffiths - 2021 - Cognitive Science 45 (6):e12983.
    Language research has come to rely heavily on large‐scale, web‐based datasets. These datasets can present significant methodological challenges, requiring researchers to make a number of decisions about how they are collected, represented, and analyzed. These decisions often concern long‐standing challenges in corpus‐based language research, including determining what counts as a word, deciding which words should be analyzed, and matching sets of words across languages. We illustrate these challenges by revisiting “Word lengths are optimized for efficient communication” (Piantadosi, Tily, & Gibson, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  2.  65
    Zipfian frequency distributions facilitate word segmentation in context.Chigusa Kurumada, Stephan C. Meylan & Michael C. Frank - 2013 - Cognition 127 (3):439-453.
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   10 citations  
  3.  25
    Evaluating models of robust word recognition with serial reproduction.Stephan C. Meylan, Sathvik Nair & Thomas L. Griffiths - 2021 - Cognition 210 (C):104553.
    Spoken communication occurs in a “noisy channel” characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition—and language processing more generally—relies heavily on listeners' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark